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Abstract 



Specific interactions between cations and proteins have a strong impact on peptide and 
protein structure. We here shed light on the nature of the underlying interactions, especially 



> 

(^ regarding the effects on the polyamide backbone structure. To do so, we compare the con- 

.^ formational ensembles of model peptides in isolation and in the presence of either Li"*" or 

o 

^r^ Na"*" cations by state-of-the-ait density-functional theory (including van der Waals effects) and 

ILJ gas-phase infrared spectroscopy. These monovalent cations have a drastic effect on the local 



S^ backbone conformation of turn-forming peptides, by disruption of the H bonding networks and 

the resulting severe distortion of the backbone conformations. In fact, Li"*" and Na"*" can even 
have different conformational effects on the same peptide. We also assess the predictive power 
of current approximate density functionals for peptide-cation systems and compare to results 
from established protein force fields as well as to high-level quantum chemistry (CCSD(T)). 
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Introduction 

As early as 1912, Paul Pfeiffer systematically studied the crystallization of short Ala and Gly 
containing peptides from aqueous solution in the presence of alkali salts'^ and postulated that Li"^ 
exhibits a higher affinity ("Additionsfahigkeit") to peptides than Na"*" and K"'".'21 Indeed, calorimetric 
studies revealed high interaction enthalpies of a series of peptides with Li''",Elin the range of solva- 
tion enthalpies of peptides. These strong interactions are in practice used to increase the share of 
cis prolyl peptide bonds from 10% to 70% upon addition of Li"'"-salts in biochemical assays of the 
activity of peptidyl prolyl cis-trans isomerases."^^ NMR studies of the cyclic peptide Cyclosporin 
A (CysA) in organic solvents revealed that Li"*" inhibits the formation of H bonds and induces 
'unusual' backbone conformations. I^SI 100 years after Pfeiffer's work, Garand et alM^ studied the 
non-covalent interactions of a non-natural peptide-based catalyst by means of gas-phase infrared 
(IR) spectroscopy. While the polyamide backbone of the molecule forms intramolecular H bonds 
if protonated, sodiation apparently results in a complete absence of H bonds due to the involvement 
of the carbonyl groups in interactions with the Na"*" cation. Both CysAlSEI and the peptide-based 
catalyst!^ form narrow turn-like backbone loops, well suited to accommodate a cation. Such turns 
are normally at the outside of globular proteins, exposed to the surrounding medium. We here in- 
vestigate the atomistic and electronic basis of cation peptide interactions in turn-forming peptides. 
The focus is on proline-containing peptides, where pronounced conformational effects of such in- 
teractions can be expected due to the possible cis and trans states of the prolyl-peptide bond.l^ 
Our study is based on accurate conformational predictions by first-principles (density-functional 
theory) in a synergistic combination with gas-phase IR spectroscopy to validate the results. 

Structure formation and dynamics in proteins can be primarily attributed to the rotation of the 
N-Ca and Ca-C single bonds, represented by the backbone torsion angles ^ and i//, respectively 
(Figure lA). This conformational (jy/x}/ space is well described by a Ramachandran diagraml^l like 
the statistical evaluation of high-resolution X-ray data^^l shown in Figure IB. The blue shaded 
areas are referred to as allowed conformational regions and can be associated with characteristic 
secondary structure types (Figure IB). 



The double bond character of the peptide bond hinders free rotation and allows for two distinct 
conformations. In general, the trans conformation with an apparently high barrier to cis is almost 
exclusively observed.'^^ni ^ significant fraction of cis conformation is only observed for the prolyl- 
peptide bond.'IS In proline, cis and trans (Figure IC) are close in energy since the Cj8 of the 
preceding building block encounters a carbon atom of proline (Ca or C5) in both states. A cis 
peptide bond, usually preceding a proline building block, is a feature of so-called type jSV/ turns 
(Figure 1D).'SIE] x^ig notation dates back to Venkatachalam, according to which j8 -turns share 
the feature of an H-bond between residues ?' + 3 — )■ i and are further classified by the backbone 
torsion angles ^ and i// of the residues i + 1 and i-\-2}^ The /3 turns of the protein backbone allow 
for a 1 80° reversal of the direction of structure propagation within four consecutive residues of a 
polypeptide chain. Similarly, Hutchinson and Thornton classify j8 -turns according to ranges for 
the backbone torsion angles ^ and ^f into eight well-defined classes (/, /', //, //', Vial, Vial, VIb, 
and VIII) and a miscellaneous type /V.^^^USI Very prominent are the common (type /) and Glycine 
(type //) turn and their inverse counterparts /' and //'. The special /3-turn types Via and VIb have a 
cis peptide bond between central residues i+ 1 and i + 2; they frequently feature proline in position 
i + 2. ^^ Idealized backbone torsion angles for these turn types can be found, e.g., in referencefTSl 

In this study, we make use of the characteristic of proline-containing peptides to allow for 
the formation of cis and trans peptide bonds as a potential strong "conformational signal" trig- 
gered by the peptide cation interaction. Indeed, Seebach and co-workers reported ion-induced 
conformational effects on peptide structure to be especially pronounced in the proximity of pro- 
line. Kunz et al. investigated a systematic series of proline-containing peptides with NMR and 
find peptides containing an Asp-Pro sequence to exhibit a cisltrans ratio that is in opposition to 
the results of all other sequences studied. ^^ Therefore, we here investigate the sequence AXPA 
(Figure ID), where P is the single letter code for proline and X is either alanine (A) or aspar- 
tate (D), allowing us to differentiate between pure cation backbone interaction and the additional 
effect of cation sidechain interactions on the peptide backbone conformation. The peptide de- 
sign in the present work avoids structure-perturbing labels and protects the termini with acetyl 
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Figure 1 : The backbone torsion angles ^ and Xjr (A) of the residues of a polypeptide chain can be 
illustrated by a Ramachandran plot (B), based on data from reference. '^ Labels highlight charac- 
teristic secondary structure types: The j8 region in the 2nd quadrant, the 3 10 and a helical region 
in the 3rd quadrant as well as left-handed a and the j8n' region in the 1st and 4th quadrant, 
respectively. The cis and trans state of the prolyl-peptide bond (C) has a drastic effect on polypep- 
tide structure, which we study with the model peptides AAPA and ADPA (D), here shown in a 
schematic /3y/-turn conformation. 



and aminomethyl groups (Figure ID) to embed the sequence in a 'protein-like' chain structure, 
avoiding endgroup effects, i.e., zwitterion formation. 

Results and discussion 

We combine an exhaustive conformational search from first principles with theoretical and exper- 
imental gas-phase IR spectroscopy. Such investigations of isolated peptides in the gas phase offer 
an unbiased view of structure formation trends intrinsic to the molecule, a successful strategy for 
charged and uncharged amino acids and peptides. ^'^"^^ By the stepwise addition of perturbing con- 
tributions, in this case cations, we aim to reconstruct the main contributions to protein secondary 
structure formation in a bottom-up approach. The success of such an approach is critically linked 
to the quality of the description of the potential-energy surface of the system under investigation. 
We employ density-functional theory (DFT) in the generalized-gradient approximation with the 
Perdew-Burke-Emzerhof (PEE) functional.!^ Van der Waals dispersion interactions are included 
through a pairwise C(,R^ term for which Cg coefficients are derived from the self-consistent elec- 
tron density, referred to as PBE-i-vdW.'22l Our use of rather accurate, but computational efficient 
approximate DFT is justified by the high-level benchmarks we present in the methods section of 
this article. 

Conformational analysis 

The theoretical conformational analysis of the short peptide AAPA (Figure ID) is already challeng- 
ing. Hypothetic ally, discretizing the backbone torsion angles with a 30 degree grid and assuming 
two possible states (cis and trans) for the peptide bonds would formally result in roughly 35 mil- 
hon conformations to evaluate. In order to tackle this massive conformational space, we resort to 
a basin hopping-like exhaustive search of the potential-energy surfaces (PES) of conventional pro- 
tein force fields (OPLS-AAl^or AMBER9#31). We employ the TINKER 5 scan routinely in an 
in-house parallelized version. In order to achieve a reliable and parameter- free description, we then 



follow up with a large set (700 to 1800 per peptide-cation system) of PBE+vdW post-relaxation 
calculations as a second computational step. 

Figure 2 shows our results for AAPA in isolation. The lowest-energy structure of the PES, 
a PVI turn with a cis prolyl peptide bond, also has the lowest free energy in the harmonic ap- 
proximation. Two alternative /3V/ turns are 4.5 and 8.3kJ/mol higher in AF^qqk. The most stable 
conformer with a trans peptide bond is a /3//' turn with A£'=2.8kJ/mol. Harmonic free energy 
contributions add a further penalty to the structure, yielding AF3oo/f=8.8 kJ/mol. In these cases, the 
maximum number of four backbone H bonds is formed. In a DFT study of Ac-Ala-Pro-NMe, Byun 
et al. also predicted a /3V/-turn as the most stable conformer in the gas phase. ^^ A comparable 
j8//' turn was not among the lowest-energy conformers of this shorter peptide. The lowest minima 
of the PES of ADPA (up to 0.7 kJ/mol) are again j8 V/ turns (Figure 3), followed by two alternative 
conformers with relative potential energies of 2 and 4 kJ/mol, respectively. In the latter, the Asp 
sidechain forms H bonds to the NH and C=0 groups of residue Ala4, resembling the shape of a 
j8-turn, hence the naming as SC-j8. For the ADPA-cation systems, we confirmed by mass spec- 
trometry (for the ADPA-cation systems) that the Asp sidechain is protonated in our experimental 
setup. Consequently, the Asp sidechain is modeled in the protonated neutral state. Harmonic free 
energy contributions make SC-/3 the preferred structure type by roughly 2kJ/mol. Notably, the 
lowest free energy structure of AAPA features a cis prolyl-peptide bond, while the respective bond 
in the lowest free energy structure of ADPA is trans configured (Figures 2 and 3). 

The attraction between backbone carbonyl groups and the Li"*" or Na"*" cations induces struc- 
tures that differ substantially from the conformers without cations: The H bonding networks in the 
low energy conformers are disrupted (Figures 2 and 3) and the backbone conformations deviate 
from the isolated case. This is in line with the above mentioned results for CysA in apolar Li 
salt solutions^ ^ and the sodiated peptide-based catalyst in the gas phase. ^ For the isolated pep- 
tides AAPA and ADPA, the backbone torsion angles ^ and i^ of the low free energy conformers 
(AF^OQK < 6 kJ/mol) are within the allowed regions of the Ramachandran plot (Figure 4). The sin- 
gle outlier for ADPA in the fourth quadrant represents the C terminal residue Ala4 of a conformer 
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Figure 2: By an exhaustive conformational search we predicted the low free energy ensembles for AAPA in isolation 
and in the presence of Li""" and Na"*". The criterion applied to select the structures shown is lowest free energy, except 
for conformer 0-1-2-4(11) which was selected for its relation to 0-1-2-4(1): both conformers can be interconverted by 
a backbone crankshaft movement. It is noteworthy how potential energy {AE, in kJ/mol) and harmonic free energy 
{AFj,ooK, in kJ/mol) hierarchies differ for the peptide ion complexes. The preference for cis (red background) and trans 
(blue background) depends on the cation complexated by AAPA. The simulated IR spectra are shown as continuous 
lines for the individual conformers as well as for the assumed ensemble of conformers (lowest row), always in com- 
parison to the experimental IR spectra in dashed lines. The tables show the relative proportion of each conformer 
within the respective mixed simulated spectra. Simulated spectra were shifted along the energy axis by a value A for 
an optimal Pendry reliability factor Rp. The atom colors: C is gray, N is blue, O is red, H is white, Li is green, and Na 
is orange. Hydrogens are only shown if part of H bonds, n 
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Figure 3: Low free energy conformers for peptide ADPA in isolation and in the presence of Li""" and Na"*". Potential 
energy (AE, in kJ/mol) and harmonic free energy (AF^qqk, in kJ/mol) are given. The prolyl peptide bond of the re- 
spective lowest free energy conformer is trans (blue background) for all three cases. The conformational ensembles of 
ADPA with Li"*" or Na""" are very similar, which is also indicated by the similarity of the experimental IR spectra (dashed 
Unes). Bottom: The simulated spectra (continuous lines) were mixed to account for a conformational ensemble, the 
tables show the relative proportion of the conformers within the mixed spectra. Simulated spectra were shifted by a 
value A along the energy axis for an optimal Pendry reliability factor Rp. The atom colors: C is gray, N is blue, O is 
red, H is white, Li is green, and Na is orange. Hydrogens are only shown if part of H bonds. 



with AF200K =4.8kJ/mol. The different possible rotameric states of the Asp sidechain prefer 
different backbone conformations. This results in more possible backbone conformations (data 
points) compared to AAPA. The cation-peptide interaction imprints ^/i// combinations (backbone 
conformations) that substantially differ from those of the unperturbed peptides. Some of them with 
still low relative free energy (0.9 kJ/mol for AAPA + Li"*" to 2.6 kJ/mol for AAPA + Na"*") are even 
located outside of the allowed regions of the Ramachandran plot (Figure 4). These outliers are not 
at the termini but in the central residues Ala2 or Asp2, which govern the overall structure of the 
peptides. Interestingly, also the cation effects on the two peptides differ. It becomes obvious that 
the conformational ensembles of AAPA with Li"*" and Na"*", respectively, differ (Figures 2 and 4), 
while for lithiated or sodiated ADPA the possible conformations seem very similar (Figures 3 and 
4). 

A canonical turn structure, type j8//' (not shown), is the lowest PES minimum of AAPA+Li"*". 
The second most stable minimum, with A£'= 0.2 kJ/mol, is a a-tum (Figure 2). Here, the con- 
sideration of harmonic free energy contributions changes the picture dramatically and 'unusual' 
backbone conformations become dominant. In the lowest free energy conformer, the Li"*" cation 
is coordinated by three backbone carbonyl groups of residues 0, 2, and 4 (Figure 2). We base 
the naming of the conformers on the peptide cation interaction by using the numbers of the inter- 
acting oxygens, e.g. 0-2-4. In case of multiple conformations with the same interaction pattern, 
they are distinguished with roman numerals, increasing with the free energy of the conformers. 
Up to four out of five possible binding partners (backbone carbonyl groups) are sterically possi- 
ble (conformers 0-1-2-4 with AF3ooa:=0.9 or 2.2kJ/mol). Although the search for minima does 
not yield information on the actual barriers connecting different conformers, their high structural 
similarity suggests dynamic interconversion at finite temperature. For AAPA-i-Li"*", the preferred 
conformation of the prolyl-peptide bond changes from cis to trans. 

Na"*" binding to AAPA results in a similar behavior: Canonical structure types (jSV/, j8//', 
a) are lowest in potential energy while structures with 'unusual' backbone conformations and 
carbonyl groups pointing towards Na"*" are most stable when harmonic free energy contributions 



are considered. However, there are substantial differences to the Li"*" adducts: the low free energy 
ensemble is more diverse and the central peptide bond of the lowest free energy conformer 0- 
1-3-4 is cis. In the case of AAPA -i- Na"*" (Figure 2), we do not show just the four lowest free- 
energy conformers (see Table 3 of the Supporting Information for all calculated free energies). 
The second lowest free energy conformer, A 1661 (AF3ooA:=2.6kJ/mol), was excluded since it was 
proven unstable in the subsequent AIMD simulations for IR spectra (see section below). Instead 
we consider 0-1-2-4(11) as fourth conformer, which is much higher in free energy. Interestingly, 
these two conformers of AAPA-t-Na"*", 0-1-2-4(1) and 0-1-2-4(11), are almost identical besides the 
orientation peptide bond between Pro3 and Ala4 (Figure 2). This peptide bond is not involved in 
any interactions and can thus rotate by a concerted motion of the adjacent torsion angles i// and 
(j), a so-called backbone crankshaft rotation.EZES During the equilibration AIMD simulations at 
300 K (in preparation of the simulations to obtain IR spectra), this interconversion from 0-1-2-4(1) 
to 0-1-2-4(11) was indeed observed within the 10 ps simulation time. The subsequent evaluation of 
IR spectra also suggests the presence of 0-1-2-4(11) in the experimentally observed conformational 
ensemble. 

In ADRA., the dominant interaction pattern is the complexation of Li"*" or Na"*" by the backbone 
oxygens 0, 2, 4 and the Asp sidechain carboxyl group. All conformers in the low energy range 
are highly similar and feature no cis prolyl peptide bonds (Figure 3). As discussed above on the 
basis of the Ramachandran plot (Figure 4), the effects of the cations on AAPA and ADPA differ. 
Li"*" enforces a trans conformation on the prolyl peptide bond of AAPA while Na"*" enforces the cis 
conformation (Figure 2). For ADPA, no such selectivity for the cation is observed. With Li"*" or 
Na"*" attached, similar structure types with trans prolyl peptide bonds are preferred (Figures 3 and 
4). 

Infrared spectroscopy 

In order to corroborate our structural findings, we obtained gas-phase infrared multi-photon disso- 
ciation (IRMPD) spectra, which reflect the same clean-room conditions as used in our simulations. 
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Figure 4: The backbone torsion angles ^ and i// of the low free energy conformers (AF300A: < 
6 kJ/mol) for AAPA and ADPA in isolation (gray squares), with Li"*" (green triangles), and with Na"*" 
(orange triangles) were plotted on top of an empiric contour plot.^^ Addition of cations disturbs the 
backbone conformations and even results in usually forbidden ^/i//. The backbone conformation 
of AAPA is sensitive to the type of cation, as illustrated by the differing torsion angle patterns with 
Li"^ and Na+. 
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Spectra were recorded from 1000 to ISOOcm^ at the free electron laser facility FELIXES! using a 
Fourier- transform ion cyclotron (FT-ICR) mass spectrometer™ The experimentally obtained spec- 
tra for lithiated and sodiated AAPA and ADPA are shown in Figure 2 and Figure 3. For AAPA 
significantly different spectral signatures where obtained for the Li"*" and Na"*" complexed forms, 
which is in line with the results of the coformational analysis described in the previous section. 
For ADPA on the other hand very similar spectra were recorded with both cations. 

In order to allow for a quantitative theory-experiment comparison, IR spectra including an- 
harmonic effects were computed from Born-Oppenheimer ab initio molecular dynamics (AIMD) 
simulations. The systems were equihbrated by 10 ps of AIMD simulations at 300 K. Subsequently, 
the micro-canonical ensemble was sampled in up to 40 ps long AIMD simulations at constant 
energy from which IR spectra were derived.l^ZEII ir spectra of polyamides feature characteristic 
bands of high intensity (like the amide I and II region, 1400 - 1700 cm^ ^ ) but also regions with low 
intensity (below 1400cm^^) and fingerprint characteristics. Visual inspection does not allow for 
a quantitative assessment and is, similar to a simple square of intensity comparison, easily biased 
by the high intensity peaks. For a quantitative comparison between the calculated and experimen- 
tal spectra, we employ the reliability factor Rp introduced by Pendry^ to the field of low-energy 
electron diffraction, in an implementation distributed with referencel43l For Rp, peak positions are 
more important than peak intensities - a characteristic that fits the requirements we face here, espe- 
cially as we compare experimental action spectra and theoretical absorption spectra. Values for Rp 
range from (perfect agreement) to 1 (no correlation). Intensities of the spectra were normalized 
to 1 and rigidly shifted (not scaled) with a value A along the energy axis to account for deviations 
likely due to a systematic mode-softening by the density functional we uscl^SSIwhen comparing 
the calculated IR spectra of single conformations to the experimental IR spectra we observe only 
modest agreement (see individual spectra in Figures 2 and 3). Previous studies have shown similar 
behavior due to conformational ensembles for peptides in the gas-phase at finite temperaturcl^S^ 
Furthermore, the energy differences of the low free-energy conformers lie within the uncertainty 
of the employed method, as discussed in the benchmarks section above. Consequently, an ensem- 
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ble of conformations is assumed. By mixing the individual theoretical spectra in 5% steps, the 
Rp to the respective experimental spectrum is optimized. This results in a much better agreement 
of simulated and experimental spectra of the peptides AAPA and ADPA in complex with single 
Li"*" or Na""" cations (Figures 2 and 3). For the predicted spectra of AAPA + Li"*" and for ADPA 
+ Li"*" or Na"*", especially the reproduction of the fine structure below 1400 cm^ wavenumbers is 
gratifying. We note for completeness that the spectra for the protonated peptides (not shown) are 
rather different in appearance, suggesting very different structural effects compared to the heavier 
cations. 

In a naive way, a correlation between the free energy estimates at the harmonic approximation 
and the abundances of the individual spectra in the resulting mixed spectrum could be expected. 
However, this would be too much to expect for several reasons: 

• The PBE+vdW method we use is rather accurate as illustrated by the benchmark calculation 
presented in the methodology section of this article, however, the systems under investigation 
here are also large (56 to 60 atoms). The lowest free-energy minima we discuss here are still 
within the possible uncertainties of the relative (free) energies. 

• The experimental data base to which we are comparing mixes of theoretical spectra is, sim- 
ply, small - and fitting many parameters to a small data set has well known limitations.SZI 
What these mixes offer is, therefore, strictly only a consistency check. The spectra of just a 
single conformer are not sufficient to explain the observed IR spectra. In contrast, confor- 
mational mixes yield a much more consistent description of the spectra, in line with several 
conformers of similar free energy. This is the primary quantitative statement that we can 
derive from the experiment-theory comparison. 

• The free energy model neglects anharmonicity as well as the entropic effects of a possibly 
greater accessible conformational space (dynamical interconversion in the case of low bar- 
riers) of specific conformers. That the latter can be of special importance is illustrated by 
the crankshaft rotation discussed above for the two conformers 0-1-2-4(1) and 0-1-2-4(11) 
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of AAPA + Na"*". The fact that both conformers can interconvert shows the extent to which 
the anharmonic nature of the potential-energy surface can play a role. In fact, in the com- 
bination of the four individual spectra that shows the best agreement with the experimental 
spectrum (Figure 2), 0-1-2-4(11) is predominant with 45%. Again, conformer 0-1-2-4(11) 
is structurally and dynamically closely related to the 0-1-2-4(1) conformer with the lowest 
harmonic free energy. This illustrates the limits of the harmonic free energy assignment to 
potential energy minima at room temperature, where such conformational and dynamical 
effects are neglected. It furthermore illustrates the limitations of the interpretation of IR 
spectra as a combination of individual and isolated conformers. 

Concluding this section we can say that, on the one hand, the accuracy of the harmonic approx- 
imation to the free energy is limited by the dynamic character of such molecular systems at finite T. 
On the other, the IRMPD spectroscopy setup we use here is limited in its resolution, especially re- 
garding the separation of individual conformers. However, we can unambiguously predict minima 
by first-principles theory and validate the results by room temperature IR spectroscopy (keeping 
the differences of static harmonic free-energy minima and actual room temperature molecules in 
mind). The observed cation-peptide effects were certainly qualitatively corroborated by both ap- 
proaches. 

Micro solvation of a peptide cation complex 

Already in the introduction we mention the presence of turn- sequences mainly at the surface of 
proteins, exposed to the aqueous environment. With this section we want to give at least a qualita- 
tive picture of how the interaction between the peptide backbone and the cation can compete with 
solvation of the cation. AIMD simulations have been performed for AAR^-i-Li"*" alone and with a 
few water molecules. For the setup of the latter system, 18 molecules were accommodated within 
a sphere of 4.5 A radius around the Li"*" cation. For comparison, Li"*" embedded in 4 and 10 water 
molecules, respectively, was also studied. We characterize the interaction between the Li"*" cation 
and the respective oxygens of the peptide backbone or of first solvation shell water molecules by 
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the Li"'"-0 distance and by the 0-Li"'"-0 angle, shown in Figure 5. Previous ab initio studies predict 
a coordination number of 4 for Li"*" in waterl^SHIESl Consistently, the cation is complexed by 4 
backbone carbonyl groups in our example conformer 0-1-2-4(1). During a 100 ps AIMD trajectory 
at 330 K (Nose-Hoover thermostat), the Li"'"-0 distance fluctuates around 1 .9 A, the 0-Li"'"-0 angle 
distribution is broad, indicating the non-ideal tetrahedron formed by the interacting carbonyl oxy- 
gens. The microsolvation of AAPA -i- Li"*" in 18 water molecules results in a slight change of the 
binding site within a few picoseconds: a water oxygen substitutes for the interaction to backbone 
C=0 of Alal. The cation interacts with the three backbone carbonyl oxygens of AAPA and the 
same water molecule (Figure 5) for the whole 90 ps of remaining AIMD simulation time. As a 
result, a virtually ideal binding site is formed, characterized by an almost symmetric distribution 
of the 0-Li"'"-0 angle around the ideal tetrahedral angle of 109.5°. For Li"*" immersed in a small 
water cluster (4 or 10 water molecules, respectively), the Li"'"-0 distance distribution peaks around 
2.0 A. Remarkably, the distribution of the tetrahedron angles 0-Li"'"-0 is multimodal again, ac- 
counting for alternative (and less populated) geometries of the Li"*" complexation involving 3 or, 
in the case of the 10 water with Li"*" cluster, even 5 water molecules in the first solvation shell. 
For now, we can at least qualitatively say that AAPA is able to form an ideal interaction shell that 
seems competitive to water solvation. A fully correct answer could be given on the basis of free 
energy differences from simulations with fully solvated systems. Such simulations are standard 
for force field approaches, yet they are computationally very demanding at the level of theory we 
employ here. A rigorous assessment is thus beyond the scope of this manuscript. 

Conclusion 

Starting from the isolated peptides that adopt either canonical turn structures (AAPA) or turn- 
like conformations with sidechain to backbone H bonds (ADPA), we show the drastic effect of 
cations on the local secondary structure of peptides: the cation attracts most of the backbone 
carbonyl groups and, as a result, completely breaks the local H bonding network. This leads to 
distortions of the peptide backbone and results in conformations with backbone torsion angles 
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AAPA Li* (0-1-2-4 (I)) 




1.6 1.8 2.0 2.2 2.4 2.6 2.8 3.0 
Li*"-0 Distance in A 



80 100 120 140 160 

0-Li*-0 Angles in deg 



Figure 5: The first solvation shell around a Li"*" cation. Interactions are formed between between 
the oxygens of water molecules (purple) or backbone carbonyl groups (red). The histograms were 
derived from AIMD simulation of different length (20 to 100 ps) and the counts were normalized 
to 1. 
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and \f/ that are in part outside of the allowed regions of the Ramachandran plot (Figure 4). 
Consequently the question of the range of such ion-induced disruptions arises. Ohanessian and co- 
workers studiedS^MIpQJy.gjy^lj^gg ^^^j^ ^ chain length of 2 to 8 in complex with Na"*" by simulation 
and gas-phase IR spectroscopy: Up to a sequence of 7 glycine residues the contact number between 
the cation and backbone C=0 groups is maximized and no H bonding was observed. With the 
Glyg peptide, backbone H bonding appeared again in form of y- and /3- turns. Glycine, due to 
the lack of a sidechain, represents a very special case among the canonical amino acids. As a 
contrast, the helical secondary structure of sodiated polyalanine (8-12) is not broken in the gas 
phase. Here, the Na"*" ion is attached to the C terminus.^^^*^ The importance of considering the 
effect of sidechain functionalities is highlighted by the sequence dependence of the cation effects 
we observe. The conformational preferences of AAR^ with Li"*" or Na"*" differ drastically by the 
trans or cis state of the central prolyl peptide bond. Noskov and Roux investigated the selectivity 
of the ion-coupled transporter LeuT. Two Na"*" binding sites (NAl and NA2) show differences in 
the Li"'"/Na"'" selectivity: NAl appears to be rather flexible and exhibits no selectivity for one cation 
over the other as it adapts to the different ionic radii. For NA2, a slight selectivity is apparently 
induced by a "snug-fit" mechanism (the rigid NA2 interaction site is unable to adapt to different 
ionic radii) .1211 Similarly, the lowest free-energy structure 0-1-3-4 of AAR^-i-Na"*" may be to rigid to 
adapt to the Li"*" cation, since only backbone carbonyl functions can be involved in the interaction. 
With ADR^., the Asp sidechain prevents such conformation selectivity. 

Our findings might even help to understand a basic biochemical principle: In 1888, Hofmeister 
published an articlelS^that laid the basis for a sorting of cations and anions according to their effect 
on the solubility of biomolecules, colloids, and functional polymers. While it was believed that 
the underlying effects can be explained solely by bulk properties stemming from the solvent ion 
interactions, 123 evidence was found that most effects of ions on water structure are limited to the 
first solvation shell'SS and that specific ion- solute interactions can be expected to contribute sub- 
stantially.^^ This gets especially clear at high salt concentrations as investigated by Dzubiella and 
co-workers employing classical MD simulationsSSlSIl and later also experimental approaches-l^Sl 
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They demonstrate that the perturbing effect of ions on peptide structure results from the breaking 
secondary structure specific H bonds in the backbone. Our own findings point into a similar di- 
rection, as we have shown here how cations can substantially change the backbone structure of a 
(bio)polymer. These interactions are not necessarily stable over a very long time range, but our 
exploratory AIMD simulations suggest time ranges at least in the tens to hundreds picoseconds. 
Dzubiella describes long-lived loop conformations that are stable over 10 to 20 ns in classical MD 
trajectories. 123 Similar to the specific interactions between anions and the amide bond containing 
polymers of N-isopropylacrylamide described by Cremer and co-workers ,1^ we show here the pos- 
sible interactions between small mono-valent cations and peptides and highlight their significant 
effect on local peptide structure. Such should be considered as one driver behind the Hofmeister 
salt effects on proteins. 

Computational methods 

Scans of the PES were peformed with a basin hopping-like exhaustive search and conventional 
protein force fields (OPLS-AA^Sor AMBERQQI^. We employ the TINKER 5 scan routinely in 
an in-house parallelized version. The required methods to perform DFT-based simulations, includ- 
ing geometry optimization, computation of harmonic vibrations, and ab initio Born-Oppenheimer 
molecular dynamics (AIMD), are incorporated in the FHI-aims code, which provides an efficient 
and accurate all-electron description based on numeric atom-centered orbitals.lSSln the following 
we discuss fully relaxed conformations at the PBE-i-vdW level and their relative potential energies 
(AE) and relative harmonic free energies at 300K (AFjqok), all computed with tight convergence 
settings and an accurate tier 2 basis set.'^D High-level quantum chemical benchmark calculations, 
i.e. relaxations at the MP2 level of theory and coupled-cluster calculations with singles, dou- 
bles and perturbative triples (CCSD(T)), were performed with the ORCA quantum chemistry pro- 
gram, ^^ CCSD(T) energies extrapolated to the complete basis set limit (CBS) were obtained by a 
method described by Truhlar,'S2 employing the Dunning basis sets cc-pVDZ and cc-pVTZ.'SSl 
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Benchmarks 

We assessed the predictive power of the DFT approximations applied here by benchmarks in two 
directions with respect to the approximation level: We compare to high-level quantum chemistry 
at the CCSD(T) level of theory extrapolated to the complete basis-set limit. On the other hand, 
we assess the quality of the force field description of cation-peptide interactions in comparison to 
approximate DFT at the PBE-i-vdW and PBEO-i-vdW levels. 

Comparing electronic structure theory methods 

There have been several assessments of the accuracy of the PBE-i-vdW level of theory applied to a 
variety of systems, e.g., peptides, ^^ weakly bound metal-phtalocyanine systems,'55' and ionic and 
semiconductor solids.'^ A previous assessment of the accuracy of PBE-i-vdW for peptide systems, 
for the conformational energy hierarchy of Ace-Ala-NMe and Ace- Alas -NMe, shows mean abso- 
lute errors (MAE) below 2 kJ/mol in comparison to CCSD(T) energies. ^^ We here investigate metal 
cation-peptide systems and thus re-assess the accuracy of our DFT-based predictions. We employ 
high-level quantum chemical theory on the conformational energy hierarchy of Ac-Ala-NMe+Li"*". 
A conformational analysis identified five local minima (Figure 6) within a potential energy range 
of 35kJ/mol at the MP2/cc-pVTZ level of theory.l^S^ The cation closes a 7-membered pseudo- 
cycle via interaction with the oxygens of the backbone carbonyl groups. The orientation of the 
methyl groups relative to the pseudo-ring plane defines them as either equatorial (Figure 6: 1, 3, 5) 
or axial (Figure 6: 2, 4). In addition, an important characteristic of our actual systems of interest 
(AAPA and ADPA) is present here as well: in the lower energy conformers 1 and 2 the C-terminal 
peptide bond is trans configured, in contrast to the other conformers with a C terminal cis peptide 
bond. 

Relative energies at the CCSD(T), PBE-i-vdW, and PBEO-i-vdW level of theory were compared 
by estimating the mean absolute error (MAE = -E/Li \fi — yi\)- As an additional method, we also 
include the frequently-used B3LYP method (no vdW correction). For much larger systems, the 
missing description of dispersion effects represents a deficiency in the description of conforma- 
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Figure 6: Lowest energy conformers of Ace-Ala-NHMe + Li"*", fully relaxed at the MP2/cc-pVTZ 
level of ab initio theory. Hydrogens were omitted for clarity; dashed black lines illustrate the 
oxygen lithium interactions. 
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tional energy hierarchies. The DFT methods give low uncertainties (see Table 1), well within the 
often stressed "chemical accuracy" of 1 kcal/mol (4.2kJ/mol). The conformational changes upon 
relaxation with DFT are negligible, as indicated by the low maximal RMSD value of 0.38 A (Ta- 
ble 1). We note that only small contributions of the van der Waals correction can be expected for 
molecular systems of this size. Of the tested DFT approaches, PBEO+vdW gives the best agree- 
ment with the benchmark calculations, as it is obvious from the MAE and the average RMSD (see 
Table 1). However, for a large-scale conformational screening and the extensive molecular dynam- 
ics simulations we undertake in this study, PBE-i-vdW perfectly balances computational costs and 
accuracy. 

Table 1: Relative energies and RMSD values of the conformers depicted in Figure Figure 6. Left 
columns: CCSD(T), PBE-i-vdW, PBEO-i-vdW, and B3LYP relative energies were calculated for 
MP2/cc-pVTZ geometries. The MAE of the DFT relative energy hierarchies to CCSD(T) is also 
given. Right columns: the conformers were also relaxed with the respective DFT methods. With 
respect to the MP2 geometries, RMSD values for the individual conformers and average RMSD 
values are given. Relative energies and the mean absolute errors (MAE) are given in kJ/mol; 
RMSD values are given in A. 







^E (MP2 g 


eometries) 




RMSD to MP2 




Conf. 


CCSD(T) 


PBE+vdW 


PBEO-i-vdW 


B3LYP 


PBE+vdW 


PBEO-i-vdW 


B3LYP 


1 


0.0 


0.0 


0.0 


0.0 


0.03 


0.03 


0.03 


2 


6.7 


4.4 


5.0 


5.7 


0.03 


0.03 


0.03 


3 


20.2 


17.1 


18.2 


22.3 


0.38 


0.16 


0.33 


4 


23.1 


22.0 


23.1 


27.1 


0.12 


0.08 


0.18 


5 


33.8 


34.2 


34.8 


38.0 


0.04 


0.02 


0.04 


MAE/I 
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2.3 


0.12 
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Standard protein force fields versus electronic structure theory 

When comparing the results of different standard protein force fields among each other and to 
DFT, we observe dramatic discrepancies in the conformational hierarchies. Such force fields were 
parametrized for the solvated state, while we perform our assessment in the gas-phase. Nonethe- 
less, they are frequently used also for conformational investigations irrespective of the environ- 
ment. Consequently, their performance in vacuo is of interest. We employ as a reference the 
conformational energy hierarchy of AAPA-t-Li"*" at the PBE-i-vdW level. In line with the results of 
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the above comparison, PBE^^ and the hybrid density functional PBE()52l (both vdW corrected'221) 
give very similar results, illustrated by the low mean absolute (MAE) and maximal errors (E,nax) 
listed in Table 2. Both approaches without the vdW correction give higher MAE and Emax- The 
widely used protein force fields Amber99,^^ Charmm22, 1221 and OPLS-AA-^^ give MAE that are at 
least about 15 times larger with a dramatic magnitude of the maximal errors (see Table 2). The 
relative energies can be found in the Supporting Information. A main characteristic of the system 
is apparently the cation peptide interaction. The effect on the partial charges appears to be better 
described by the polarizable FF Amoeba,'— 'illustrated by a MAE of about lOkJ/mol. The removal 
of the cation leads to reduced MAE values for the FF methods (cf. Table 2), also the energy 
hierarchies themselves appear more consistent in the different methods. This can be seen when 
comparing the two plots (with and without Li"*") in Figure 7 or when looking at the maximal errors 
given in Table 2. The calculations (single point) were repeated for the same AAPA conformers 
(fixed geometries) but without the cation. The MAEs to the force field approaches are consistently 
much larger than between the DFT techniques, with significant errors especially also in the en- 
ergetic hierarchy of the conformers. Apparently, the large errors of the force fields can mainly 
be attributed to the ill-described cation-peptide interaction. In short, DFT based approaches for 
cation-peptide systems appear to be vastly preferable at least over the standard force field-based 
approaches tested here. 

Table 2: MAE and Emax, with respect to the PBE-i-vdW hierarchy, of the energy hierarchies com- 
puted with PBEO-i-vdW, PBE, PBEO, Amoeba,!^ Amber99,'^4 Charmm22,^o ^^d OPLS,'23for the 
fixed geometries with and without the Li"*" cation (energies in kJ/mol). The relative energies can be 
found in Tables 1 and 2 of the Supporting Information. 

PBEO-i-vdW PBE PBEO Amoeba Amber99 Charmm22 OPLS-AA 

Fixed AAPA-i-Li+ geometries 

5.7 6.1 9.7 15.7 20.5 26.0 

10.9 11.1 22.3 30.1 53.6 69.1 

Same geometries, fixed without Li+ 

6.6 6.6 8.1 14.0 15.6 11.6 

14.3 14.8 20.3 37.5 36.8 29.7 
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Figure 7: The relative energies of 21 AAPA-t-Li"*" conformers with a relative potential energy below 
lOkJ/mol at the PBE-i-vdW level were re-calculated with PBEO+vdW, PBE, PBEO, Amoeba,!^ 
Aniber99,'^^ Charmm22,'22l and OPLS,'^^ for the fixed geometries with and without the Li"*" cation. 
The conformational energy hierarchies with different FF and DFT methods are plotted against the 
in PBE-i-vdW values (;c-axis). The relative energy values were shifted according to the overall 
offset of the individual energy hierarchy. Please note the different scales of the x-axes in the plots 
with and without Li"*" cation. The hypothetical perfect correlation is indicated by the straight lines, 
but all force fields deviate significantly even for the qualitative energetics. 
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Experimental methods 
Synthesis 

Peptides were synthesized by solid phase assembly using a Multi-Syntech Syro XP peptide syn- 
thesizer (MultisynTech GmbH, Witten, Germany) by Fmoc strategy on Fmoc-Ala-OWang resin 
(0.5 mmol/g). The peptides were cleaved from the resin by reaction with 2 ml of a solution contain- 
ing 10% (w/v) triisopropylsilane, 1% (w/v) water, and 89% (w/v) TFA. The crude peptides were 
purified by reversed-phase HPLC on a Knauer smartline manager 5000 system (Knauer GmbH, 
Berlin, Germany) equipped with a C8 (10 jUm) LUNATM Phenomenex column (Phenomenex Inc., 
Torrance, CA, USA). Peptides were eluted with a linear gradient of acetonitrile/water/0.1% tri- 
fluoroacetic acid and identified on an Agilent 6210 ESI-TOF mass spectrometer. Peptide purity 
was determined by analytical HPLC on a Merck LaChrom system (Merck KGaA, Darmstadt, Ger- 
many) equipped with a C8 (10 /im) LUNATM Phenomenex column (Phenomenex Inc., Torrance, 
CA, USA). The gradient used was similar to those of the preparative HPLC. 

Infrared spectroscopy 

The gasphase IR experiments were performed at the free electron laser facility FELDd221(Nieuwegein, 
The Netherlands) using the Fourier-transform ion cyclotron (FT-ICR) mass spectrometei®! which 
was temporarily equipped with a nano electrospray ionization source (MS Vision, Almere, The 
Netherlands). Typically, 5 /il of a solution containing 1 mM peptide, 50% water, 50% methanol 
and, where needed, lOmM LiCl or NaCl, were placed in gold-coated, off-line emitters prepared 
in-house. In order to obtain a stable spray, a small backing pressure of approximately 0.5 bar and a 
relatively low capillary voltage of approximately 850 V was applied to the needle. The nESI gen- 
erated ions were accumulated in a hexapole ion trap and subsequently transferred into the FT-ICR 
mass spectrometer that is optically accessible via a KRS-5 window at the back end. After trap- 
ping and SWIFT mass-isolation inside the ICR cell, the ions were irradiated by IR photons of the 
free electron laser FELIX. ^^ The light provided by FELIX consists of macropulses of about 5 jUs 
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length at a repetition rate of 10 Hz, which contain 0.3 to 5ps long micropulses with a micropulse 
spacing of 1 ns. The wavelength is continuously tunable over a range of 40 to 2000 cm ^ Here, 
typically wavelengths from 500 to 1850cm^^ were scanned. When the IR light is resonant with an 
IR active vibrational mode in the molecule, this results in the absorbance of many photons, which 
causes dissociation of the ion (IRMPD). Monitoring of the fragmentation yield as a function of IR 
wavelength leads to the IR spectra. 
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